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Abstract 

We analyze a probabilistic algorithm for matching shapes modeled by planar regions un- 
der translations and rigid motions (rotation and translation). Given shapes A and B, the 
algorithm computes a transformation t such that with high probability the area of overlap of 
t{A) and B is close to maximal. In the case of polygons, we give a time bound that does not 
depend significantly on the number of vertices. 

1 Introduction 

The Problem. Matching two geometric shapes under transformations and evaluating their sim- 
ilarity is one of the central problems in computer vision systems where the evaluation of the re- 
semblance of two images is based on their geometric shape and not color or texture. Because of 
its significance the problem has been widely covered in the literature, see [3l [10] for surveys. 

Depending on the application, 2D shapes are modeled as finite point patterns, polygonal chains 
or polygons. Given two shapes A and B, as well as a set of transformations T and a distance 
measure d, the problem is to find the transformation t & T such that t{A) and B match optimally 
with respect to d. Two shapes are considered similar if there is a transformation t such that the 
distance between t{A) and B is small. The problem is well-studied for various settings, e.g., sets 
of line segments, rigid motions and the Hausdorff distance. 

In this paper we consider the problem of matching 2D shapes modeled by plane open sets, 
e.g., sets of polygons, with respect to the area of the symmetric difference, which is the area that 
belongs to exactly one of the shapes. As sets of allowed transformations T we will consider the set 
of translations and the set of rigid motions (rotation and translation) in the plane. Minimizing the 
area of the symmetric difference under translations or rigid motions is equivalent to maximizing 
the area of overlap, so we will consider the latter formulation of the problem for the rest of this 
article. The area of overlap is a well-known similarity measure, and, e.g., has the advantage that 
it is insensitive to noise. Furthermore, computing the maximal area of overlap of two sets of 
polygons under translations or rigid motions is an interesting computational problem on its own. 



Related Work. For simple polygons, efficient algorithms for maximizing the area of overlap 
under translations are known. Mount et al. show that the maximal area of overlap of a simple 
n-polygon with a translated simple m-polygon can be computed in 0{n?m'^) time. Recently, 
Cheong et al. 6J introduced a general probabilistic framework for computing an approximation 
with prespecified absolute error e in 0{m -\- (n^/e**) log(n)^) time for translations and 0{m -f 
{tt' je'^) log(n)^) time for rigid motions. 

De Berg et al. [7 consider the case of convex polygons and give a 0{{n + m) log(n -I- m)) 
time algorithm maximizing the area of overlap under translations. Alt et al. [5] give a linear time 
constant factor approximation algorithm for minimizing the area of the symmetric difference of 
convex shapes under translations and homothctics (scaling and translation). 

*This work was partially supported by the European Union under contract No. FP6-511572, Project PROFI 
and by the DFG Priority Programme 1307 Algorithm Engineering. 
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For higher dimensions Ahn et ah present in jT] an algorithm finding a translation vector max- 
imizing the overlap of two convex polytopes bounded by a total of n hyperplanes in R'' for d > 3. 
Their algorithm runs in 0{n^'^/'^'^^^ log'^~^ n) time with probability at least 1 — n~^'^^\ 

Surprisingly little has been known so far about maximizing the area of overlap under rigid 
motions. 

Overview. We will design and analyze a simple probabilistic matching algorithm, which for 
translations works as follows. Given two shapes A and B, in one random experiment we select 
a point a € A and a point b d B uniformly at random. This tells us that the translation t that 
is given by the vector b — a maps some part of A onto some part of B. We record this as a vote 
for t and repeat this procedure very often. Then we determine the densest cluster of the resulting 
point cloud and output the center of this cluster as a translation that maps a large part of A onto 
B. For rigid motions we consider two different approaches for the vote generation in one random 
experiment. 

We show that the algorithm approximates the maximal area of overlap under translations and 
rigid motions. More precisely, let i°P* be a transformation that maximizes the area of overlap of 
A and i?, and let t* be a transformation computed by the algorithm. Given an allowable error 
s and a desired probability of success p, both between and 1, we show bounds on the required 
number of random experiments, guaranteeing that the absolute difference between approximation 
and optimum \t°^^{A)nB\ — \t*{A)nB\ is at most e\A\ with probability at least p. Here | • | denotes 
the area (Lebesgue measure) of a set. Furthermore, we prove that this algorithm computes a (l+e)- 
approximation of the maximal area of overlap under translations and rigid motions, meaning that 
\t°P\A) n B\ ~ \t*{A) n B\ < (1 + e)\t°P\A) n B\ with high probabihty, if we make a reasonable 
assumption about the input shapes. 

This algorithm is a special case of a probabilistic algorithmic scheme for approximating an op- 
timal match of planar sets under a subgroup of affine transformations. Alt and Scharf [J] analyzed 
another instance of this algorithmic scheme that compares polygonal curves under translations, 
rigid motions, and similarities. 

2 The Algorithms 

2.1 Description of the Algorithms 

Shapes. We consider shapes modeled by open bounded, and therefore, Lebesgue measurable 
subsets of the plane. We always assume the shapes to have positive area. Additionally, we assume 
that there is a method to select uniformly distributed random points from a shape and the density 
function is Lipschitz continuous (see Section [3?2|) . This is the case for sets of disks and for sets of 
polygons, or equivalently, sets of triangles, which probably is the most common representation in 
practice. The idea of the algorithm can be applied to bitmap data as well. 

For a shape represented by n triangles a random point can be generated by first selecting 
a triangle randomly with probability proportional to the relative area of the triangle and then 
selecting a random point from that triangle. 

For an arbitrary Lebesgue measurable set A in the plane if we are given a set of triangles T it 
is contained in, and an algorithm that decides "Is p £ A?" , we can sample uniformly at random 
from y T and discard points that are not in A. The density function is Lipschitz continuous, for 
example, if the shape boundaries are unions of piecewise differentiable simple closed curves. 

General Idea. The idea of the algorithm is quite simple. Given two shapes A and B, repeat the 
following random experiment very often, say N times: Select random point samples of appropriate 
size from each shape and compute a transformation that maps the point sample of one shape to 
the sample of the other shape. Keep this transformation, called a "vote" , in mind. In each step, 
we grow our collection of "votes" by one. Clusters of "votes" indicate transformations that map 
large parts of the shapes onto each other. 
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Every translation can be associated with a point in two dimensional space and every rigid 
motion with a point in three dimensional space. The densest cluster of "votes" is then defined as 
the transformation t* whose ^-neighborhood with respect to the maximum norm contains the most 
transformation points from random experiments for some parameter S 0- Thus, along with the 
shapes A and B we have two additional input parameters: N determines the number of random 
experiments and S adjusts the clustering size. 

This algorithm captures the intuitive notion of matching. Transformations whose (S-neighbor- 
hoods contain many "votes" should be "good" translations since they map many points from A 
onto points from B. Figure [1] illustrates this idea for the case of translations. 



Figure 1: We compare two copies of a square under translations. The area of overlap of t{A) and 
B corresponds to the chance of choosing a point pair {x,y) E A x B such that y — x = t. 



Translations. Observe that two points in the plane uniquely determine a translation that maps 
one point onto the other. Therefore, a point sample for the case of translations consists of one 
randomly selected point of each shape. 

ProbMatchT 

Input: shapes A and B, an integer N, and a positive real i5. 

1. Perform the following experiment N times: 

Draw uniformly distributed random points a G A and b (z B. 
Register the translation vector b — a. 

2. Determine a translation t* whose (5-neighborhood contains the most registered vectors. 
Output: translation t* 

Rigid Motions with Random Angle. The algorithm for rigid motions, ProbMatchRMRA, 
is similar to the algorithm for translations. The space of rigid motions R is given as / x C 
where / — [—1/2, 1/2). We use the interval [—1/2, 1/2) instead of [— tt, tt) because we regard this 
interval as a probability space, which should have measure 1, avoiding a constant in the density 
function. A point (a, t) G R denotes the rigid motion 



For matching under rigid motions, we select in each step uniformly distributed an angle a 
and random points a Cz A and b Cz B. We give one "vote" to the unique rigid motion with 

^In order to make it reasonable to use the same tolerance 5 for both, translations and rotations, it is advisable 
to normalize the translation space. 
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counterclockwise rotation angle a that maps a onto 6, namely the map 

X I— *■ MaX + (6 — Matt). 

Rigid Motions with 3+1 Points. Another variant for rigid motions is the algorithm Prob- 
MatchRM3+1, which does not choose a completely random rotation but prefers directions that 
are present in the shape. A rigid motion is determined by selecting two points ai, 02 in A and one 
point bi in B uniformly at random. Then, we select another point 62 in such that the distances 
between the points in ai and a2 and bi and &2 are the same, i.e., &2 = &i + ||a2 — ai||-^^/3(J), 
where /3 e [—1/2, 1/2) is randomly selected under uniform distribution. If 62 happens to be in B, 
(oi, a2, 61, 62) is a valid random sample. Otherwise, we discard the sample and select new points. 
In this way, we select uniformly distributed tuples from 

S = {{ai,a2,bi,l3) e x B x I : b2 = bi + \\a2 - ai\\MpQ e B}. 
2.2 Main Results 

Approximation Theorems. First, we give bounds on the required number of random experi- 
ments. The main results are the following approximation theorems. 

Theorem 1 (Absolute Approximation). For any two shapes A and B and parameters e,T with 
< e, r < 1 there exist a positive real S = 0(e) and an integer N such that the following holds: 
Let the transformation t* be the output of ProbMatchT or ProbMatchRMRA, respectively, 
let t°P* be a transformation that maximizes the area of overlap of A and B , then 

I \t*[A)C^B\-\t°'^\A)C^B\ I <e\A\ 

with probability at least 1 — r. In the case of translations 

^ = O (c/£6log(max{l/T, c/e})) , 

in the case of rigid motions 

iV 0(C/£^log(max{l/T, C/e})) , 

where c = \B\'^ l^'^ /\Af , C = \B\'^ IS.^ /\Af , A is the length of the boundary of A and D is the 
diameter of A. 

If we know that the shapes we have to match are not too "skinny" we can also bound the num- 
ber of experiments required by algorithm ProbMatchRM3+1 in order to achieve the absolute 
approximation error of at most 

We say that a shape A is K-/af □ for some constant < k < 1 if there exists an inscribed circle 
C C A such that \C\ > k\A\. 

Theorem 2. Let A and B be two n-fat shapes such that the largest inscribed circle in A is at most 
as large as the largest inscribed circle in B. For all parameters e, r with < e, r < 1 there exist a 
positive real 5 = Q{e) and an integer N such that the following holds: Let the transformation t* be 
the output of PROBMATCHRM3-f 1 and t°P* a transformation that maximizes the area of overlap 
of A and B , then 

I \t* [A) r\ B\ - \t°P\A) r\ B\ I <e\A\ 
with probability at least 1 — t. The required number of experiments N is 

^-o(max|^log(max|i ||) , ^log^}) 

where C — \B\'^ /\A\^ , A is the length of the boundary of A and D is the diameter of A. 

^Our definition of K-fatness differs from the standard definition. 
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Further, under the assumption that the shapes are K-fat, we get a relative approximation 
for all three variants of the algorithm from the absolute approximation results. For algorithms 
ProbMatchT and ProbMatchRMRA a weaker assumption that the maximal area of overlap 
of A and B is at least a constant fraction of \A\, i.e., maxt \t{A) n i?| > k\A\ for some < k < 1, 
is sufficient for the relative error bound. Observe that if the two shapes A and B are K-fat and A 
is the shape with the smaller largest inscribed circle, then maxt \t{A) D B\ > k\A\. 

The relative error bound follows if we choose e' — en and apply the absolute approximation 
results: 

\\t°P\A) nB\- \t*iA) nB\\< e'\A\ = eK\A\ < e\t°P\A) n B\. 

Corollary 3 (Relative Approximation). Given two shapes A and B, let t* , i°P', e, 6, t, N be as 
in TheoremUl for algorithms ProbMatchT and ProbMatchRMRA, and as in Theorem\^for 
algorithm ProbMatchRM3+1. Assume that |t°P*(yl) B\ > k\A\ for some constant k in case 
of translations and rigid motions with random rotation angle. For rigid motions with 3 + 1 points 
assume that A and B are n-fat. Then with probability at least 1 — t 

\\t*{A) r\B\- \t°^\A) r\B\\< £|i°p'(A) n b\ 

if N is chosen as in Theorem Q] for algorithms ProbMatchT and ProbMatchRMRA, only 
that now c = \B\'^A'^/\A\'^k'^ and C = \B\'^A'^D'^/\A\'^k^, where A is the length of the boundary 
of A and D is the diameter of A. For algorithm ProbMatchRM3+1 the necessary number of 
experiments is N = O ^max | j^tt log ^max |i, , ^ log i where C is as in Theorem\^ 

Runtime for Sets of Polygons. The runtime of the algorithm consists of the time Tgcn needed 
to generate N random samples and the time Tarr needed to find the transformation t* whose S- 
neighborhood contains the most registered transformation vectors. 

Assume that shapes are sets of polygons, without loss of generality, sets of triangles. A random 
point in a triangle can be generated in constant time using barycentric coordinates. For generating 
a random point from a set of n triangles we select a triangle randomly with probability proportional 
to the relative area of the triangle and then take a random point from the selected triangle. We 
first compute the areas of the triangles and partition the unit interval [0, 1] by subintervals whose 
lengths are proportional to these areas. Then the selection of a random a G [0, 1] and a binary 
search on this partition gives us a random triangle. Thus, we get preprocessing time linear in n 
and O(logn) generation time for a single point. Therefore, Tgcn{n, N) — 0{n + Nlogn). 

Determining a translation whose (5-neighborhood obtained the most "votes" can be done by 
traversing the arrangement A given by the boundaries of the (S-neighborhoods of the N votes from 
the random experiments. The depth of a cell is defined as the number of neighborhoods it is 
contained in. The candidates for the output of the algorithm are the transformations contained 
in the deepest cells in this arrangement because a transformation t lies in the intersection of k of 
the neighborhoods if and only if its neighborhood contains k votes. The size of the arrangement 
is 0{N'^) for translations and 0{N'^) for rigid motions. The deepest cells can be determined by 
constructing and traversing the complete arrangement, which can be accomplished in time 0{N'^) 
for translations and 0{N^) for rigid motions. 

The runtime can be improved if, instead of the deepest cell in the arrangement, an approxi- 
mately deepest cell is computed. If the depth of the arrangement is d, a witness point of depth 
k such that {1 ~ e)d < k < d can be computed in time TarrC^) = 0{Ne'^ \ogN) 0. The total 
runtime of the algorithm is then 0{Ne~^ log N + N log n + n). We will show later that the quality 
of the output can still be guaranteed if we approximate the depth. 

In the following theorem we refer to the probabilistic algorithms as described in Section 12.11 
except that in step 2 of each algorithm a transformation with an approximately largest number of 
"votes" in its (5-neighborhood is returned. 

Theorem 4. Let A and B be two shapes represented by sets of n triangles in total. Let t°P' 
denote the transformation maximizing the area of overlap of A and B For a given error tolerance 
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e and maximal allowed failure probability t with < £, r < 1, the three algorithms described in 
Section \2.1\ in combination with the depth approximation algorithm of jSl compute a transformation 
t'^PP, such that 

I |t^pp(^) n S| - |i°p*(A) n B| I < e\A\ 

with probability at least 1 — t in time 0{c/e^ log(niax{l/T, c/e}) \og{cn/e) + n) for translations 
(algorithm ProbMatchTJ and in time 0(C/£^'' log(max {1/r, C / e})\og{Cn/ s) + n) for rigid 
motions with random rotation angle (algorithm ProbMatchRMRA^. Therein c= |-BpA'*/|yl|^, 
C = |i?pA^Z3^/| A is the length of the boundary of A, and D is the diameter of A. 

For algorithm ProbMatchRM3+1 the shapes A and B are additionally required to be n-fat 
for some < k < 1 . The running time of the algorithm is then 
O (max log (max {i, g^}) , ^log^}), where C = \B\^ A'' Dy\A\^. 

2.3 Overview of the Analysis 

In Section [3] we analyze the probability distribution implicitly given in the transformation space 
by the random experiment. It turns out that in the case of translations and in the case of rigid 
motions where the rotation angle is chosen randomly the density function is proportional to the 
function mapping a transformation vector to the area of overlap of the transformed shape A and 
B. For the rigid motions and algorithm ProbMatchRM3+1 the density function is propor- 
tional to the squared value of the area of overlap. Further, we prove that the density functions 
are Lipschitz continuous. Therefore, the probability of a ^-neighborhood of a transformation t 
converges uniformly to the value of the density function at t times the size of the (5-ncighborhood 
as S approaches zero. 

Then in Section U] we show that the relative number of transformations generated by random 
experiments that are contained in the (5-neighborhood of a transformation i is a good approxima- 
tion of the probability of that (5-neighborhood, in the sense that the probability of a large error 
decreases exponentially in the number of experiments. 

Finally, we combine the uniform continuity of the density functions and the probability ap- 
proximation results to derive rigorous bounds on the number of experiments required to find a 
transformation that with high probability approximates the maximum area of overlap within the 
given error bound. 



3 Density Functions 

3.1 Determining the Density Functions 

In this section we analyze the density functions of the probability distribution induced by the ran- 
dom experiments of the algorithm in the transformation space. We show that for translations and 
for rigid motions with random rotation angle the value of the density function for a transformation 
t is proportional to the area of overlap \t{A) n B\. For the algorithm ProbMatchRM3-|-1 the 
induced density function is proportional to the squared area of overlap. Additionally, we show 
that in all three cases the density functions are Lipschitz continuous. 

For deriving the density functions underlying the random experiments we will use the following 
probability theoretical transformation formula for density functions of random variables, see for 
example 0. 

Theorem 5. Let X : K" ^ M" be a random variable with density function fx, open set G C K" 
be the support of fx cind ip : G ^ R" a continuously differentiate injective map, i.e., ip : G G' , 
where G' = fiG), is a bisection. Let A(x) = det(^^(x))i.j=i^....„. Then ip o X has the density 
function 

, {fx{v~\y))\A{^-\y))\-^ for y EG' 

fvox(y) ~ < rf m ■ 

fory^G' 
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For translations and rigid motions with random rotation angle we can apply the following 
special case: 

Corollary 6. Let X : M" R" be a random variable with density function fx and h : R" 
R", h : X Mx a linear map with det(Af ) 7^ 0. Then ho X has the density function 

fhoxiy) = fx{M-'y) |det(M-i)|. 

For a subset i? of a set R let xb ■ R — * {0, 1} be the characteristic function of B that is 1 if a 
point from i? is in i? and otherwise. 

The Density Function for Translations. 

Lemma 7 (Translations). The density function of the probability distribution on the translation 
space that results from the experiment in algorithm ProbMatchTrans is given by 

gT{t)^\t{A)C^B\/{\A\\B\) . 

Proof. We model the experiment by regarding X — idAxB on R^ as uniformly distributed random 
variable. X corresponds to the sample pairs selected by the random experiment. The density 
function of X is 

f , ,N XA[a) XBjb) 
^^("'^^^ \A\\B\ ■ 

Consider the bijective function (/j : R'' — > R'', (p : (a, b) 1^ {a,b — a), ip maps a pair of points (a, b) 
to a point-translation pair (a,t) where t is the translation that maps a to b. By Corollary E] the 
density function oi ip o X \s 

r ( ^\ ^ XAja) XB{a + t) _ XAn{B-t){a) 
J^^^^"^^'- \A\\B\ ~ 

The density function on the translation space R^ is the density function of the projection of ipo X 
to the last two coordinates: 



\A r]{B~t)\ _ \t{A)nB \ 

\A\\B\ -^A\\B\ 



9T(t) = I fy,cx(a,t)da = TTTTT?! = t Attnt ■ ^ 



The Density Fiinction for Rigid Motions with Random Angle. 

Lemma 8 (Rigid Motions with Random Angle). The density function on the space of rigid 
motions R induced by algorithm ProbMatchRMRA is given by 

gnA{r)^\r(A)nB\/{\A\\B\). 

Proof. Our random experiment consists in selecting uniformly distributed points from il = / x 
Ax B where / = [—1/2, 1/2). We are interested in the density function /y of the random variable 

Y -.n^ R, Y : {a,a,b) {a,b- M^a) . 

We will express the density function of Y in terms of the conditional probability densities of the 
following two random variables Yj and Yr defined as 

Yj : fl ^ I, Yj : (a, a,b) i-^ a , 

Yt -.fl-^ R2, Yt : (a, a,b)^b- M^a . 

The density function of Y is the joint density of the random variables Yj and Yr. Recall that 
the counterclockwise rotation angle is selected uniformly distributed in / independently from the 
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points a and b. So the marginal probability density of Yj, i.e., probability density of Yj — a 
allowing all possible values of Yr , is 



fiia) - ^ = 1 . 

The value of Yr depends on the selected points a and b and on the value of Yj . The conditional 
probability density of Yt ~ t given Yj — a is exactly the probability density in the space of 
translations for shapes Ma A and B : 

Mt\Yj^a)^ \Am\ ■ 

The conditional probability density can also be expressed in terms of the joint probability density 
fxit \ Yj = a) = /y'(a, t)/ fi{a). Thus we get for any rigid motion r = (a, t) that 

9BA{r)^fY{r) = \r{A)nB\/i\A\\B\) . □ 
The Density Function for Rigid Motions with 3+1 Points. 

Lemma 9 (Rigid Motions with 3+1 Points). The density function on the space of rigid motions R 
induced by the algorithm ProbMatchRM3+1 is given by 

gs+i{r)^\r{A)nB\yc , 

where c is a positive real depending on A and B which is at most \ A\'^\B\. 

Proof. In one random experiment we select uniformly distributed random elements from the set 

S = {{ai,a2,bi,P) e X B X I : ai ^ a2,b2 = bi + ||a2 - aipZ/jQ G B} . 
The density function of the random variable X = idg is then 

Xs{a,i,a2,bi,l3) 



fxiai,a2,bi,(3) 



XA{ai)XA{a2)XB(bi)xB{b2) 
\S\ 



where 62 is as in the definition of S. 

Let Y denote the random variable corresponding to the rigid motion resulting from one random 
experiment: 

Y : S ^ R , Y : {ai,a2,bi, P) 1-^ {a,bi — MaOi) , 

where a = Z(a2 — ai, 62 — 5i)/27r. We represent y as a composite function of random variable X. 
Define functions ipi,f2 as follows: 

ifi : S ^ X I (pi : (01,02,61,/?) i-> (ai,a2,6i - Maai,a) 

Lp2 ■ R^ ^ I ^ R (p2 ■ {ai,a2,t,a) {a,t) . 

Then Y — ip2 o fi o X . 

Observe that the set S is open, since the sets A and B are open, the excluded set of tuples 
where ai = 02 is a closed set, and the interval / is equivalent to a unit circle and is therefore 
open. Further, the function ipi and its inverse are bijective and differentiable, so we can apply 
Theorem[5]to ipi o X . We first compute the determinant A(a;) of the Jacobian matrix of ipi. It is 
easy to see that det(^^)ij=i,... ^ = det(^^)ij=5,...^7 since (pi,i(a;i, . . . , ^7) = for i = 1, . . . , 4. 
Note that the angle a does not depend on bi and it depends linearly on /3: a = /3 — 7, where 
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7 = Z(a2 - ai, (J))/27r. Therefore, ^^'^ g,^;"^^' = <5y, ^ = for j = 1,2, and || = 1. Now we 
have that 

'I Ai^ 

1 A2 

"J/ »,i=5,...,7 VO 1 



di£i 



for some Ai,A2 € ffi. Thus, A(x) = 1. The inverse function of (pi maps a tuple (ai,a2,t, a) to 
(ai, a2, i + MqCi, a + 7). By Theorem [5] we get that the density function oi tpio X is 

/i^oids(ai: "2, a) = XA{ai)xA{a2)XB{t + Maai)xB{t + Maa2)/\S\ 

= XA{ai)XA{a2)XM^^{B~t){ai)XM^^{B-t){a2)/\S\ . 

The density function of random variable y on i? is then 

g3+i{a,t) = 1/\S\ / X(Anr-i(S))2(ai,a2)d(ai,a2) 

^\Anr-\B)f/\S\ 
^\r{A)nB\y\S\ , 

where r — (a, i). □ 



3.2 Lipschitz Continuity of the Density Functions. 

In this section we show that the density functions of the probability distribution induced in the 
space of transformations by the algorithm are Lipschitz continuous. 

A function h from a metric space M to M is called Lipschitz continuous if there is a constant 
L such that for all x,y E M holds 

\\x - y\\ < 5 =^ \h{x) - h{y)\ < L5 . 

Let B^g denote the (5-neighborhood of a transformation t with respect to the maximum norm and 
let /i5 be the Lebesgue measure of B\. We are interested in the density functions to be Lipschitz 
continuous because then P{Bg)/fis converges to h{t) ioi S ^ uniformly on the transformation 
space, where P is the probability distribution with density function h. 

Lemma 10. For fixed shapes A and B let h be the density function of the probability distribution 
P in the transformation space induced by the probabilistic algorithm. There exists a constant L 
such that for every transformation t 

\P{Bl) ^ pLsh{t)\ <Lpis5 , 

where Bg denotes the 6 -neighborhood of a transformation t with respect to the maximum norm and 
fis is the Lebesgue measure of B\. 

For translations L — Lt — \/2A/(|yl| \B\), for rigid motions and algorithm ProbMatchRMRA 
L = Lra = (V2 + 27r£))A/(|A||i3|), and for rigid motions and algorithm ProbMatchRM3+1 

min(|A|, |i?|)/c, where D is the diameter and A the boundary length 
of A, and c is the constant from Lemma [Pl 

Proof. Assume that h is Lipschitz continuous with constant L then 

P{BI) ^ / h{x)dx < fis sup h{x) < fj,sih{t) + LS) and 
JbI x<^B\ 

I h{x)dx > fis inf h{x) > iJs{h{t) — L5) . 

JbI 

It remains to show that for every variant of the probabilistic algorithm the induced probability 
density function is Lipschitz continuous. Let f{r) denote the area of overlap r{A) n B for a rigid 
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motion r. Wc first show that the function / is Lipschitz continuous. We assume without loss of 
generahty that the input shapes A and B contain the origin. 

Let r = (a,p), s = q) be rigid motions whose distance is less than 5 in the maximum norm. 
Then the distance between translation vectors is ||p — 9|| < V^S and |a — /3| < S. 

The difference in the area of overlap for r and s can be bounded by the area of the symmetric 
difference between r{A) and s{A): 

f{r) ^ \r{A) n B| = l(^(^) \ n S| + \{r{A) n s{A)) n B\ 

< \r{A)\s{A)\ + \s{A)nB\ . 

Thus, /(r) - /(s) < \r{A) \ s{A)\. Similarly, f{s) - f{r) < \s{A) \ r{A)\. Combining these two 
estimates we get 

|/(r)-/(s)|<max(|r(A)\s(A)|, \s{A)\r{A)\) . 

> 

Let w{S) be the maximal length of the line segment r{x)s{x) for x € A. The difference in the area 
of overlap is at most Aw{S) since r{A) and s{A) differ by at most a ?x;((5)-wide strip along the 
boundary of r(A). Next we find an upper bound on the length w{S). 

By an easy geometric argument the distance between a point x and its rotated image can be 
expressed as ||x — MqxH = 2||x|| • | sin(27rck/2)| for < |27ra| < tt/2. Since |sin7| < I7I for all 7 
this distance can be bounded by ||a; — Mq-xH < ||x|| • |27ra|. 

Let x ^ A then | jxl | < D. We use the above argument to bound the distance between the image 
of a; under rigid motions r and s. Observe that since S is small we can assume that < \2Tr6\ < 7r/2. 

||r(a;) -s(a;)|| = \\Mo,x + p ~ Mpx - q\\ 

< \\Mo,x- M0x\\ + \\p-q\\ 

< ||M„_;3:e|| + V2,5 

< 27r|a-/3| • ||x|| +\/25 

< 2n6D + V2(5 = (V2 + 2t:D)S 

for all X in A. 

Then for arbitrary rigid motions r and s such that ||r — s|| < 5 the difference in the area of 
overlap can be bounded by — /(s)| < (a/2 + 2TrD)SA. The Lipschitz constant for the area of 
overlap is then Lf = {V2 + 2ttD)A. 

The density function of the probability distribution in the space of rigid motions induced by the 
algorithm ProbMatchRMRA is gRA{r) = f{r)/{\A\\B\) by Lemma[Hl Then for rigid motions r 
and s such that ||r — s|| < (5 we get 

\9BAr) - 9RAis)\ = - f{s)\/{\A\\B\) < {V2 + 27:D)AS/i\A\\B\) . 

Thus, the Lipschitz constant of the function gj^A is L^a — (\/2 + 27r_D) A/(| A| 

The density function induced by the algorithm ProbMatchRM3+1 is g3+i{r) = P{r)/c, 

where c is the constant from Lemma IHl 

Observe that if / is bounded and Lipschitz continuous with Lipschitz constant Lj, then is 

also Lipschitz continuous with constant Lp = 2Lf sup^, f{x) due to the following consideration: 

\fix) - f{y)\ = \f{x) - fiy)\ ■ |/(x) + fiy)\ < Lj ■ \x - y\ ■ |/(x) + /(y)| 
< 2L/|x- y|sup/(z) 

Z 

= 2{V2 + 27rD) A\x -y\ sup f{z)/c . 

z 

Thus, the Lipschitz constant of the function (73+1 is 

L3+1 = 2(V2 + 27ri:>) A sup /(z)/c . (1) 
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The maximal possible area of overlap of two shapes under rigid motions is clearly bounded by 
the area of the smaller shape. Therefore, the function 53+1 is Lipschitz continuous with constant 
L3+1 < 2(v^+27ri:))Amin(|A|,|B|)/c. 

In the case of translations we can disregard rotation, so the Lipschitz constant L^a = (\/2 + 
27r£') A/(|yl| |i?|) for the density fimction in the case of rigid motions reduces to Lt = \/2A/(|A| |_B|) 
for translations. 

Note that the constants depend heavily on the shapes. □ 

4 Absolute Error Approximation 

In the previous section (Lemma I10[) we showed that for the probability distributions in the space 
of transformations induced by the algorithms for translations and for rigid motions the value of the 
probability function P for a (5-neighborhood Bg (divided by the measure fis of the (5-neighborhood) 
of a transformation t converges to the value of the density function h{t) for that transformation 
as S approaches zero. 

In this section we prove that the relative number of "votes" in the (5-neighborhood of a transfor- 
mation Ms a good approximation of the probability P{Bg) and complete the proofs of Theorem[l] 
and Theorem m 

Let the random variable X^{t) denote the number of registered transformations in the S- 
neighborhood of transformation t. We use the Chernoff bound formulated as in [S] for proving 
that for fixed t with high probability Xf^{t)/N and P{Bg) do not differ much for large N. 

Theorem 11 ([6J Chernoff bound). Let Xi, . . . , Xn be independent binary random variables, let 
X — X]i=i '^'^'^ let < e < 1. Then 

P{\X ~ E{X)\ > sN) < 2e'''^/\ 



Applying this bound to our setting yields 
Corollary 12. For each transformation t and for all < s < 1 

Pi\Xf,it)/N - P(B*)| > e) < 2e-^'^/2. 

Proof. Define Xi = Xb* for i = I, . . . , N . The Xi are identically distributed, independent, binary 

random variables with X^(t) = J^'Li and E{Xff{t)) = NP{Bl). The inequahty of Corollary[12] 
results from applying the Chernoff bound to Xi, . . . , Xjy. □ 

This shows that Xf^{t)/N converges to P{Bg) in probability as the number of random experi- 
ments goes to infinity for each t in the transformation space. 
We have already seen that for fixed t 

Xf,{t)/N .''^ P{Bl) , h{t) . 

Ill prob. unilormly 

Now we need to analyze what happens if the transformation vector is determined by the sequence 
of random experiments, namely a vector whose 5-neighborhood obtains the most "votes" , and 
thus is a random vector itself. 

The output of the algorithm can be modeled as a random variable 

Z% = maxX^(i). 

Let S ~ (si, . . . , Sat) be a sequence of transformations from the random experiments. Consider the 
arrangement A induced by the boundaries of B^^ , ■ ■ ■ , Bg" , which are the (5-spheres with respect 
to the maximum norm of the points in S. The depth of a cell is defined as the number of Bg^ it is 
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contained in. The candidates for the output of the algorithm are the transformations contained 
in the deepest ceUs in this arrangement. A transformation t lies in the intersection of k of the 
neighborhoods if and only if its neighborhood contains k "votes" . 
The next lemma can be proven using an idea of [B]. 

Lemma 13 (Key Lemma). Let V be the set of all vertices of the arrangement A and the 
transformation maximizing the area of overlap \t{A) D B\. Then for each e > and N > 6/e + 2 
holds in the case of translations 

P{3t e y U {t°P*} : \X%{t)/N - P{Bl)\ >e)< 2Ar2e-^'(^-2)/8 ; 
and in the case of rigid motions, 

P{3t e y U {t°P*} : \X%{t)/N ~ P{Bl)\ >e)< |Ar3e-^'(^-3)/8 . 

Proof. In the case of translations, a vertex of A is defined by 2 ^-spheres, and 2 (5-spheres define 
at most 2 vertices, so \V\ < N{N — 1) and \V U {t°P*}| < N^. In the case of rigid motions, a 
vertex of A is defined by 3 (5-spheres, and 3 (5-spheres define at most 2 vertices, so \V\ < 2(^) = 
- + 2N/3 and |t/ U {t°P'}| < 
Let us consider translations first. Let t be a vertex of A, then t is an intersection point of two 
(5-spheres B^,Bf for some votes v^w. Consider same random sequence without two experiments 
yielding v and w. Since the random points are independent and the choice of t does not depend 
on the shorter random sequence, we can apply Corollary [T^ to t with iV — 2 experiments. 

P{\X%_^{t)/{N - 2) - P{Bl)\ > e/2) < 2e--'(^-2)/8 . 

For each (5-neighborhood, the fractions of "votes" that lie in the neighborhood differ by at most 
2/{N — 2) if two arbitrary "votes" are deleted. Therefore, 

\Xf,it)/N - PiBl)\ < \Xf,_,{t)/{N - 2) - PiBl)\ + 2/iN - 2) . 

Choosing N >A/e + 2we get that 2/{N - 2) < e/2 and 

P{\X%{t)l{N) - P{Bl)\ >e)< 2e-^'(^-2)/8 . 

As argued above there are at most N"^ such vertex points together with the point t°P*. Applying 
the triangle inequality we get the claim of the Lemma for the case of translation. 

The argumentation for rigid motions is analogous. □ 

Next we show that if the probability density function h is Lipschitz continuous then the value 
of P{B\) for two transformations that lie in one (5-neighborhood does not differ much. 

Proposition 14. Let r,s,t be transformations such that r, s G _B^. Then 
\P(Bg) ~ P(Bg)\ < 4:Lfis6 where L is the Lipschitz constant of the density function h of P and pis 
is the Lebesgue measure of a B\. 

Proof. 



h(v)dv — / h{v)dv 



\PiB^s) - P{BI)\ = 

< iLfisS . □ 



< /i5(sup h{v) — inf h{v)) 
Bi BI 



It follows from Lemma [T3] and Proposition [H] that with high probability for every trans- 
formation t the relative number of votes in its (5-neighborhood is a good approximation of the 
probability P{Bl). 
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Corollary 15. Let < £ < 1 and N > 6/e + 2. For all transformations t with probability at least 
1 — 2N'^e~^ (N-2)/s g^gg translations and with probability at least 1 — |A^^e~^ {N-3)/8 

in the case of rigid motions it holds 

\X'j,{t)/N-P{Bl)\<e + ALfi5S , 

where L is the Lipschitz constant of the probability density function. 

Proof. Let t be an arbitrary transformation and let t' be a vertex of the cell in A which contains 
t, such that X^(t') = Xl^{t). Then 

- < \XW) P{Bi)\ + \P{Bt) - P{Bl)\ 

< e + ALSfis by Lemma [T51 and Proposition [T3] . □ 

That means that the output of the algorithm does not need to be chosen among the vertices 
of the arrangement A. 

Next we show that if the probability density function is Lipschitz continuous then the trans- 
formation with the maximum number of "votes" in its (^-neighborhood results in a good approxi- 
mation of the maximum of the density function with high probability. 

Lemma 16. Let h(t) be the density of the probability distribution P in the transformation space 
induced by the algorithm, and L the Lipschitz constant of h. Let i°P* be the transformation max- 
imizing h{t) and t* the transformation maximizing Xff{t), where Xff(t) denotes the number of 
transformations generated by the random experiments that are contained in the d -neighborhood 
oft. The Lebesgue measure of a S -neighborhood is denoted by ^s- Then for all < e < 1 

h{t*) > /i(i°P') - 2e/^5 - 6W 

with probability at least l~q, where q = 2N^e~^ {N-2)/8 translations and q = lA^^e""^ (Af-3)/8 
in the case of rigid motions. 

Proof. It follows from Lemmas [TU] and [T3] and from Corollary [1^] that with probability at least 
1 — q the following two statements are true: 

\^ish{t*) X%{t*)/N\ < \f,sh{t*) - P{Bt)\ + \P{Bl')- Xl,{t*)/N\ 

< LS^is + e + ALfisS by Lemma ITOl and Corollary [15] 

< e -I- 5L^sS 

and 

\fxshit°P') - X^(t°P*)/7V| < \fish{t°P') - P(i?r)| + \P{Br') - X^(i°P*)/iV| 

< LfigS + e by Lemmas ITOl and fT3l . 

By the definition of t* , Xff{t*) > X^(i°P'). Then 

hit*) > {Xf,{t*)-e-5LfisS)/^is 

> (X^(t°P')-e-5W)/Ai5 

> {nsh{t°P') - e - L^isS - e - 5LfisS)/fis 
= /i(t°P*) - 2e/iJs - QLS , 

which concludes the proof. □ 
Now we can prove Theorem [TJ 
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Proof of Theorem[I[ Let f{t) denote the area of overlap t{A)nB, h{t) the density of the probabihty 
distribution P in the transformation space induced by the algorithm, and L the Lipschitz constant 
of h. We will use two parameters < S,ri < 1, which will be determined later. The parameter 
5 specifies the size of the neighborhoods used by the matching algorithms, and i] is the error 
tolerance value used when applying lemmas or corollaries. 

Recall that t* is a transformation with the maximum number of transformations generated by 
the random experiments in its (S- neighborhood and the transformation maximizing f{t). Let 
q denote the failure probability from Lemma [13] in terms of the number of experiments N and the 
error tolerance 77, for translations q — 2N'^e^'^ {n-2)/8 g^^^^ j-\g[(\ motions q = ^N^e~^ (n-3)/8_ 

Since \t*{A)nB\ < \t°P\A) n B\ holds by definiton, we only have to show that \t*{A)r\B\ > 
\t°P^{A) n B\ — s\A\. By LemmadHlfor translations and rigid motions 

h{t*)>h{t°P^)-2ri/^is-(iLS (2) 

with probability at least 1 — q. 

Translations: In the case of translations we have by Lemma [7] h(t) — \t{A) n = 
f{t)/\A\\B\, L = V2A/\A\\B\ by LemmaHOl and us = 4(5^. Then by Q 

4o^ 

Bounding this error to be at most e\A\, we get 

452^ 24v^(53A 



2\A\\B\ 2\A\\B\ 



In order to maximize this expression we differentiate it with respect to S and determine the value 
of S for which that derivative is zero. The value of S maximizing the above expression and the 
corresponding value of rj are then 

Finally, we want the probaility of failure q = 2N^e~^ (n-2)/8 most r. A straightforward 

analysis shows that for N > 80/ri^\og{80/ri^) this value is at most (JV-2)/i6 Solving the 
inequality e"** (^^2)/i6 ^ ^ with respect to N we get that for N > max{— 16/77^ log(l/r) + 
2, 80/77^ log(80/772)} the approximation error is at most e\A\ with probability at least 1 — r. 

Rigid motions with random rotation angle: Here we have by Lemma|S]/i(t) = |<(A)ni?|/|A||_B| — 
f{t)/\A\\B\, L = {V2 + 2ttD)A/\A\\B\ by Lemma [TUl and ^5 = 8S^. Then by © 

/(i°P') - /(t^PP) < + 6(V2 + 2ttD)AS . 

Again, bounding this error by ej A| and maximizing the expression for r/ as in the case of translations 
we get that the optimal choice for 5 and r] is 

6 = ^ and ri = ^ — . (4) 

8(V2 + 27ri:>)A 512(V2 + 27ri:')3A3|B| ^ ^ 

By a straightforward analysis the probability of failure q — |A^3g-?)^(Af-3)/8 g^-j- ^ost e^''^^^"^)/^^ 
for aU TV > 112/r;2 log(112/?72). Solving the inequality e-'''(^-2)/i6 < ^ ^j^j^ respect to N we 
get that for all N > max{— 16/77^ log(l/T) + 3, 112/77^ \og{112 / rj^)} the approximation error of the 
algorithm is at most e\A\ with probability at least 1 — r. □ 

Next we prove Theorem [5] in a similar fashion. 
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Proof of Theorem[^ Since in algorithm ProbMatchRM3+1 some random samples are rejected 
and, therefore, do not induce a vote in the transformation space, we first determine the necessary 
number of not rejected experiments, in the following denoted by M, in order to guarantee the 
required error bound with high probability. Afterwards, we determine the total number N of 
random samples that the algorithm needs to generate in order to record at least M votes in the 
transformation space with high probability. 

Let f{t) denote the area of overlap t{A) OB, h{t) the density of the probability distribution 
P in the space of rigid motions induced by algorithm ProbMatchRM3+1, and L the Lipschitz 
constant of h. 

Let t* be a transformation with the maximum number of transformations generated by the 
random experiments in its (5-neighborhood and t°P* the transformation maximizing f(t). Let q 
denote the failure probability from Lemma [13] in terms of the number of votes M and the error 
tolerance rj, q = |M3e-'''(^-3)/8^ 

Since \t*{A) Ci B\ < \t°P\A) n B\ holds by definition, we only have to show that \t*{A) n B\ > 
\t°P\A) nB\- e\A\. By Lemma [Il]/i(t*) > /i(t°P') - 2r]/^is - GLS with probability at least 1 ~ q. 

For algorithm ProbMatchRM3+1 the density function h{t) — f^{t)/c, where c is the con- 
stant from LemmaEl The Lipschitz constant is L = 2(-\/2 + 27r£))A min(|yl|, \B\)/c by Lemma fTUl 
where D is the diameter and A the length of the boundary of A, and the size of the ^-neighborhood 
is Us = 8S^. In the proof of Lemma [TUl we actually showed a stronger bound on the Lipschitz con- 
stant (Equation (P)), namely L3+1 = 2(V2-h27ri:))A sup^ f{z)/c, which is 2{V2 + 2'k D) A f(t°P^)/c. 
This stronger bound is used in the computations below. 

Additionally, since the shapes A and B are At-fat, and the shape A is the one with the smaller 
largest inscribed circle, the maximal area of overlap of A and B under rigid motions is at least as 
large as the area of the largest inscribed circle in A, that is, /(i°P*) > k\A\. 

By Lemma [12] we get that with probability at least I — q 

-(/^P') - f{t*))if{t°^') + fit*)) = - 

c c c 

= h{t°p^) - h{t*) 

2ri 



Hence, 



, 6S2{V2 + 2TrD)Af{t°P') 



2r]c 6S2{V2 + 2iTD)Af{t°'P^) 



53(/(t°P') + /(t*)) /(i°P*) + /(t*) 



rjc 12S(V2 + 2nD)Af(t°P^) ^, 

< .^^1 .1 + 12<5(V2 + 2^i^)A (since /(t°P*) > k\A\) 

< ^^^^^ + 12^(\/2 + 27ri:')A (sincec< lAplBl byLa.[9l) . 

Restricting this error to be at most s\A\ and maximizing the expression for 77 we get that the 
optimal values for b and t] are 

/i-^i .„d -ifi^ ^ (5) 

\<6{^/2^2ttD)A 163|B|(^/2 + 27^i:')3A3 ^' 

The probabihty of failure is g = |A/3e-'''(^^-3)/8 < g-r,2(M-3)/i6^ Analogous to the proof of 
Theorem [T] we can show that q < t/2 for aU M > max{-16/r/2 log(2/T), II2/772 log(112/?72)}. 
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Thus, if the number of recorded votes is 



where C = \B\'^ /S!" /\A\^ , A is the length of the boundary of A, D is the diameter of A, and 
k is an appropriate constant, then the area of overlap determined by the algorithm differes by at 
most e\A\ from the maximal area of overlap with probability at least 1 — t/2. 

Next we determine the total number N of random samples that the algorithm needs to generate 
in order to record at least AI votes with probability at least 1 — t/2. For that purpose we first 
determine the probability that one randomly generated sample is not rejected. 

Let Ca, and Cb denote the largest inscribed circles in A and B, respectively. And let r^,rs 
denote the radii of Ca and Cb- By the definition of K-fatness \Ca\ > k\A\ and \Cb\ > ^iSj, and by 
a precondition of the theorem < re- 
consider a circle of radius contained in A and a circle C'g concentric with Cb oi radius 
rB — rA/2 > TB/'i- in B. The area of is at least k|A|/4 and the area of is at least K|i?|/4. 
Then the probability that two randomly selected points ai and 02 from A are both contained in 
is at least (k/4)^. The distance between ai and 02 is at most r^/2. The probability that a 
randomly selected point bi from B is contained in C'^ is at least k/4, and by construction, the 
complete circle centered at 61 with radius equal to the distance between oi and 02 is completely 
contained in Cb C B. Therefore, for every choice of two points in C a and a point in C'^ and 
for every randomly chosen direction the random sample induces a vote in transformation space. 
The probability that one random sample is not rejected is then at least as large as (k/4)'^. In the 
following this probability is denoted by p. 

Our algorithm generates in every step random samples independently. For each sample the 
probability not to be rejected is at least p. Then the expected number of valid samples after N 
steps is at least pN . Let X denote the number of valid samples after N steps. Using the Chernoff 
bound (Theorem llip we can determine the number of steps N for which X is not much smaller 
than pN with high probaility: 

P{X <pN - ^N) < 2e"«'^/2 , 

for aU < ^ < 1. For ^ = p/2 we have P {X < §7V) < 2e~p'"/^ Restricting this failure 
probability to be at most r/2 we get that for N > 8p~^ ln(4/r)) the number of votes X is at least 
pN/2 with probability at least 1 — t/2. Then with N > 2M/p random samples the algorithm 
geenrates at least M votes with probability at least 1 — t/2. 
Finally, choosing 

A^>max|^, ^ln(4/T))|=max|fci.^log(max|i, , fc2 • ^ log i} 

with appropriate constants fci, fc2, we get that with probability at least (1 — t/2) the number of 
recorded votes ist sufficiently large, and therefore, with probaility at least (1 — t/2)^ > 1 — t the 
approximation of the maximum area of overlap differs from the optimum by at most e\A\. □ 

Proof of Theorem [7J For a given arrangement A and an error tolerance e the depth approximation 
algorithm of [5] finds a point t^^^^ such that the depth of t^PP is at least e times the maximum 
depth in A. It remains to show that with high probability the area of overlap induced by i^PP is 
a good approximation of the maximal area of overlap. 

Let h{t) be the density of the probability distribution P in the transformation space induced 
by the algorithm, and L the Lipschitz constant of h. Let A be the arrangement of (5-neighborhoods 
of the transformations resulting from the random experiments for some parameter 5 > which 
will be specified later. Additionally we will use an error tolerance value 77 > when applying 
lemmas or corollaries. The parameter rj will also be computed later. 
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Let t* be a transformation with maximum depth in the arrangement A, t^^^ the transformation 
approximating the maximum depth, i.e., Xf^{t'^PP) > (1 — r])Xf^{t*), and the transformation 
maximizing f{t). And let q denote the failure probability from Lemma [T51 

Applying Lemma [TU] and Corollary [12] to t^PP and error tolerance rj we get 

Mt-PP) - X'^it-PP)/N\ < Me^n - PiBDl + \PiBD - X^(i"PP)/iV| 

<rj + BLusS (6) 

and, likewise, by Lemmas fTOl and fTSl 

- Xi{t°'P')/N\ < LiJL.S + ry . (7) 

Since |t'^PP(A) n B| < |i°P*(A) n B\ holds by definition, we only have to show |t'^PP(yl) n B| > 
\t°'^\A) n B\ - e\A\. 

Consider Case 1: X^li'^PP) > X%{t°'P^). Then 

/i(i-PP) > (X^(i^PP) - - 5L^isS)/fis by ® 

> (X^(i°P')-7/-5LAi5<5)/M5 

> {fish{t°P') ^Tj- LfisS - 77 - 5LfisS)/fis by © 
/i(i°P') - 2tj/ fis - 6Ld . 

If X^(i^PP) < X^(t°P*) (Case 2) we have that 

X^(i^PP) < Xf,{t°P') < Xiit*) and |X^(i^PP) - X'j,{t*)\ < vXHn . 

It follows that also |X^(t^PP) - Xf^{t°P^)\ < riX%{t*). 

Now we can bound the difference |//5/i(t^PP) — ^5ft,(i°P*)| : 

+ |X^(i"PP)/iV-X^(<°P*)/iV| 

+ |X^(i°P')/7V - PlsDl + - • 

Applying Lemma [TU] to the first term of the expression. Corollary [TS] to the second. Lemma [T5I to 
the fourth. Lemma [TU] to the fifth, and bounding the third term by rjX^^{t*)/N < rj from above 
we get: 

|M5/i(t^PP) - fisHe'P')] < LfisS + 7] + 4Lns5 + + r; + Lfis5 
= 377 + GL^sS ■ 

Then in Case 1 and in Case 2 ft.(i^PP) > h{t°P^) - Sry/jUa - 6LS. Observe that this error bound for 
^app (jjffgj-g from the bound for t* in Lemma [T6l onlv by a constant in the summand rj/ fis- Therefore 
we can complete the proof of the Theorem 2] by the same arguments as in the proof of Theorem [1] 
with values of 6 and 77 that differ from Equations (O, ([1]), and (O only by constants. □ 

5 Open Questions 

We showed that our algorithm is a relative error approximation under the assumption that the 
maximal area of overlap of the two input shapes is at least a small constant fraction of one of the 
shapes. This is a reasonable assumption, but nevertheless it would be interesting to know whether 
the algorithm is a relative error approximation without further assumptions on the shapes. 

Furthermore, it might be reasonable to use measure theoretic methods for the analysis of the 
algorithm. We can show that Xf^{t)/N converges almost surely uniformly to P{B\) on T, using 
the Uniform Law of Large Numbers II, as stated in [S]. But it appears difficult to determine the 
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convergence rate to deduce bounds on the required number of random experiments by measure 
theoretic methods. 

It is an interesting question how to apply our probabihstic technique to matching shapes under 
similarity maps. The straightforward technique of choosing a pair of random points from A and 
one from B and giving a vote to the corresponding similarity map does not lead to the desired 
result. 

Furthermore, choosing three points in each shape defines a unique afiine transformation that 
maps the points onto each other, so there is a canonical version of the algorithm for affinc trans- 
formations. It would be interesting to know whether this algorithm computes something useful if 
the parametrization of the space and the definition of the neighborhoods are chosen cleverly. 
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